PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Pavir.5NG049400.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Panicodae; Paniceae; Panicinae; Panicum
Family HD-ZIP
Protein Properties Length: 849aa    MW: 91984.2 Da    PI: 5.957
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Pavir.5NG049400.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox57.91.7e-181573357
                         --SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHC....TS-HHHHHHHHHHHHHHHHC CS
             Homeobox  3 kRttftkeqleeLeelFeknrypsaeereeLAkkl....gLterqVkvWFqNrRakekk 57
                         k  ++t+eq+e+Le+l++++++ps  +r++L + +    +++ +q+kvWFqNrR +ek+
  Pavir.5NG049400.1.p 15 KYVRYTPEQVEALERLYYECPKPSSLRRQQLVRDCpvlaNVDPKQIKVWFQNRRCREKQ 73
                         6789*****************************************************97 PP

2START176.51.6e-551623692204
                          HHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS.SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-SEEEEEEEECTT..E CS
                START   2 laeeaaqelvkkalaeepgWvkssesengdevlqkfeeskvdsgealrasgvvdmvlallveellddkeqWdetlakaetlevissg..g 89 
                          +aee+++e+++ka+ ++  Wv+++ +++g++++ +++ s+++ g a+ra+g+v m++a  v+e+l+d++ W ++++++e+++v+  g  g
  Pavir.5NG049400.1.p 162 IAEETLTEFLSKATGTAVEWVQMPGMKPGPDSIGIIAISHGCAGVAARACGLVGMEPA-KVAEILKDRLLWLRDCRSMEVVNVLPAGnnG 250
                          789*******************************************************.8999999999******************9** PP

                          EEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--....-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SS CS
                START  90 alqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppe...sssvvRaellpSgiliepksnghskvtwvehvdlkgr 175
                          +++l +++l+a+++l+p Rdf+ +Ry+  l +g++v++++S++s+q  p+    ++++R+e+lpSg+li+p+++g+s +++v+h+dl+ +
  Pavir.5NG049400.1.p 251 TIELLYMQLYAPTTLAPaRDFWLLRYTSILDDGSLVVCERSLSSKQGGPSmppVQPFIRGEMLPSGFLIRPSDGGGSIIHIVDHMDLEPW 340
                          **********************************************9998888899********************************** PP

                          XXHHHHHHHHHHHHHHHHHHHHHHTXXXX CS
                START 176 lphwllrslvksglaegaktwvatlqrqc 204
                          ++++++r+l++s+++ ++kt +a+l++++
  Pavir.5NG049400.1.p 341 SVPEVVRPLYESSAMVAQKTSMAALRYLR 369
                          *************************9986 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5007115.3371074IPR001356Homeobox domain
SMARTSM003891.2E-141278IPR001356Homeobox domain
PfamPF000464.1E-161573IPR001356Homeobox domain
CDDcd000862.02E-151575No hitNo description
SuperFamilySSF466892.44E-161676IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.3E-171773IPR009057Homeodomain-like
CDDcd146861.11E-567104No hitNo description
PROSITE profilePS5084825.241152371IPR002913START domain
CDDcd088751.50E-74156370No hitNo description
Gene3DG3DSA:3.30.530.206.1E-24159365IPR023393START-like domain
SuperFamilySSF559614.94E-37161371No hitNo description
SMARTSM002343.9E-47161371IPR002913START domain
PfamPF018523.5E-53162369IPR002913START domain
PfamPF086701.4E-49701844IPR013978MEKHLA
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 849 aa     Download sequence    Send to blast
MVTAKEAAAA MDASKYVRYT PEQVEALERL YYECPKPSSL RRQQLVRDCP VLANVDPKQI  60
KVWFQNRRCR EKQRKESSRL QALNRKLTAM NKLLMEENDR LQKQASQLVY ENGYYRQQTQ  120
SAGLTTTDTS CESVVTSGQQ NVAGAQPQAQ PRDASPAGLM SIAEETLTEF LSKATGTAVE  180
WVQMPGMKPG PDSIGIIAIS HGCAGVAARA CGLVGMEPAK VAEILKDRLL WLRDCRSMEV  240
VNVLPAGNNG TIELLYMQLY APTTLAPARD FWLLRYTSIL DDGSLVVCER SLSSKQGGPS  300
MPPVQPFIRG EMLPSGFLIR PSDGGGSIIH IVDHMDLEPW SVPEVVRPLY ESSAMVAQKT  360
SMAALRYLRQ VAHEDTHSVT GWGRQPAALR ALSQKLTRGF NEALNGLADD GWSVIESDGV  420
DDVCISVNSS PSKVINCNSA FNNGLPIVSS SVLCAKASML LQDVSPAALL RFMREQRSQW  480
ADNNLDAFFA SAMKPNFCNL PMSRLGGFSG QVILPLAHTF DPEEFLEVIK LGNASTYQDA  540
LIHRDLFLLQ MYNGVDENTV GTCSELIFAP IDASFSDDSP LLPSGFRIIP IDSPLDTSSS  600
KCTLDLASTL EVGTPRSRIS GSGSGNAACA SSKAVMTIAL QFAFESHLQD SVATMARQYM  660
RSIIASVQRI ALALSSSRLV PQVGGISHAP AAASATPEAA TLSRWICQSY RFHFGSELIK  720
SADASGCEAG LKALWHHASA ILCCSLKAVP VFTFANQSGL DMLETTLVAL QDITLEKVLD  780
DQGRKNLCAE LPGVMEQGFA CIPGGLCVSG LGRPVSYEKA LAWKVLDDDS GAHCICFMFV  840
NWSFVASM*
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Pvr.179040.0root| stem
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002454995.10.0hypothetical protein SORBIDRAFT_03g002660
SwissprotA2WLR50.0HOX29_ORYSI; Homeobox-leucine zipper protein HOX29
TrEMBLC5XLT30.0C5XLT3_SORBI; Putative uncharacterized protein Sb03g002660
STRINGSb03g002660.10.0(Sorghum bicolor)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP37438197
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G52150.10.0HD-ZIP family protein